Exploiting noise in array CGH data to improve detection of DNA copy number change

نویسندگان

  • Jing Hu
  • Jian-Bo Gao
  • Yinhe Cao
  • Erwin Bottinger
  • Weijia Zhang
چکیده

Developing effective methods for analyzing array-CGH data to detect chromosomal aberrations is very important for the diagnosis of pathogenesis of cancer and other diseases. Current analysis methods, being largely based on smoothing and/or segmentation, are not quite capable of detecting both the aberration regions and the boundary break points very accurately. Furthermore, when evaluating the accuracy of an algorithm for analyzing array-CGH data, it is commonly assumed that noise in the data follows normal distribution. A fundamental question is whether noise in array-CGH is indeed Gaussian, and if not, can one exploit the characteristics of noise to develop novel analysis methods that are capable of detecting accurately the aberration regions as well as the boundary break points simultaneously? By analyzing bacterial artificial chromosomes (BACs) arrays with an average 1 mb resolution, 19 k oligo arrays with the average probe spacing <100 kb and 385 k oligo arrays with the average probe spacing of about 6 kb, we show that when there are aberrations, noise in all three types of arrays is highly non-Gaussian and possesses long-range spatial correlations, and that such noise leads to worse performance of existing methods for detecting aberrations in array-CGH than the Gaussian noise case. We further develop a novel method, which has optimally exploited the character of the noise, and is capable of identifying both aberration regions as well as the boundary break points very accurately. Finally, we propose a new concept, posteriori signal-to-noise ratio (p-SNR), to assign certain confidence level to an aberration region and boundaries detected.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

O-27: Genome Instabilities in Preimplantation Development Leading to Genetic Variation between Tissues of Normal Human Fetuses

Background: Origin of midlife copy number variations (CNVs) between tissues in non-genetic diseases is unknown. Such genomic differences caused by post-zygotic events. They might either happen during the life or due to prevalent mosaicism in preimplantation stage. We aim to explore fetal mosaicism and its origins. Materials and Methods: Two apparently normal fetuses were achieved following the ...

متن کامل

P-243: Prenatal Diagnosis Using Array CGH: Case Presentation

Background: Karyotype analysis has been the standard and reliable procedure for prenatal cytogenetic diagnosis since the 1970s. However, the major limitation remains requirement for cell culture, resulting in a delay of as much as 14 days to get the test results.CGH array technology has proven to be useful in detecting causative genomic imbalances or genetic mutations in as many as 15% of child...

متن کامل

A Linear-Time Algorithm for Analyzing Array CGH Data Using Log Ratio Triangulation

DNA copy number is the number of replicates of a contiguous segment of DNA on the genome. Copy number alteration (CNA) is a genetic abnormality in which the number of these segments differs from the normal copy number, which is two for human chromosomal DNA. The association of CNA with cancer has led to a proliferation of research into algorithmic methods for detecting these regions of genetic ...

متن کامل

Accurate detection of aneuploidies in array CGH and gene expression microarray data

MOTIVATION Chromosomal copy number changes (aneuploidies) are common in cell populations that undergo multiple cell divisions including yeast strains, cell lines and tumor cells. Identification of aneuploidies is critical in evolutionary studies, where changes in copy number serve an adaptive purpose, as well as in cancer studies, where amplifications and deletions of chromosomal regions have b...

متن کامل

Dependent Laplacian Bivariate Shrinkage Estimator for Swpt Based Array-cgh Data Smoothing

Array based comparative genomic hybridization (array-CGH) has merged as a highly efficient technique for the detection of chromosomal imbalances. Characteristics of these DNA copy number aberrations provide the insights into cancer, and they are useful for the diagnostic and therapy strategies. In this paper, we propose a statistical bivariate model for array CGH data in the stationary wavelet ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 35  شماره 

صفحات  -

تاریخ انتشار 2007